Reinforcement Learning Framework for Modeling Spatial Sequential Decisions under Uncertainty: (Extended Abstract)
نویسندگان
چکیده
We consider the problem of trajectory prediction, where a trajectory is an ordered sequence of location visits and corresponding timestamps. The problem arises when an agent makes sequential decisions to visit a set of spatial locations of interest. Each location bears a stochastic utility and the agent has a limited budget to spend. Given the agent’s observed partial trajectory, our goal is to predict the remaining trajectory. We propose a solution framework to the problem considering both the uncertainty of utility and the budget constraint. We use reinforcement learning (RL) to model the underlying decision processes and inverse RL to learn the utility distributions of the locations. We then propose two decision models to make predictions: one is based on long-term optimal planning of RL and another uses myopic heuristics. We finally apply the framework to predict real-world human trajectories and are able to explain the underlying processes of the observed actions.
منابع مشابه
Examining Extended Dynamic Scripting in a Tactical Game Framework
Dynamic scripting is a reinforcement learning algorithm designed specifically to learn appropriate tactics for an agent in a modern computer game, such as Neverwinter Nights. This reinforcement learning algorithm has previously been extended to support the automatic construction of new abstract states to improve its context sensitivity and integrated with a graphical behavior modeling architect...
متن کاملA Robust Reliable Closed Loop Supply Chain Network Design under Uncertainty: A Case Study in Equipment Training Centers
The aim of this paper is to propose a robust reliable bi-objective supply chain network design (SCND) model that is capable of controlling different kinds of uncertainties, concurrently. In this regard, stochastic bi-level scenario based programming approach which is used to model various scenarios related to strike of disruptions. The well-known method helps to overcome adverse effects of disr...
متن کاملIntelligents et de Robotique Hierarchical & Factored Reinforcement Learning
This thesis is accomplished in the context of the industrial simulation domain that addresses the problems of modelling of human behavior in military training and civil security simulations. The aim of this work is to solve large stochastic and sequential decision making problems in the Markov Decision Process (MDP) framework using Reinforcement Learning methods for learning and planning under ...
متن کاملAutonomous Model Management via Reinforcement Learning: Extended Abstract
Concept drift a change, either sudden or gradual, in the underlying properties of data is one of the most prevalent challenges to maintaining high-performing learned models over time in autonomous systems. In the face of concept drift, one can hope that the old model is sufficiently representative despite concept drift. Alternatively, one can discard the old data and retrain a new model with (o...
متن کاملApplication of truncated gaussian simulation to ore-waste boundary modeling of Golgohar iron deposit
Truncated Gaussian Simulation (TGS) is a well-known method to generate realizations of the ore domains located in a spatial sequence. In geostatistical framework geological domains are normally utilized for stationary assumption. The ability to measure the uncertainty in the exact locations of the boundaries among different geological units is a common challenge for practitioners. As a simple a...
متن کامل